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Introduction 

Our  awarded  research  project  is  to  engineer  the  next  generation  of  novel  anticancer  bio¬ 
agents  that  will  target  and  eradicate  a  broad  spectrum  of  solid  tumors  including  breast  cancer. 
Our  research  design  was  based  on  the  hypotheses  that  (1)  the  next  generation  of  bio-agents 
could  be  engineered  to  continuously  produce  doses  of  an  anticancer  drug  (specifically  FK228) 
at  the  sites  of  bacterial  spore  germination  and  vegetative  growth  in  the  hypoxic/necrotic  regions 
of  solid  tumors,  and  (2)  that  the  synergistic  actions  of  bacterial  consumption  of  tumor  mass  and 
anticancer  drug  activities  could  destroy  many  kinds  of  solid  tumors  regardless  of  cancer 
genotype,  without  the  need  for  prodrug  injections  and  could  overcome  the  common  side  effects 
and  less  efficacy  of  current  cancer  therapies. 

The  project  has  three  specific  aims,  corresponding  to  three  tasks  defined  in  the  State  of 
Work,  with  each  aim/task  to  be  accomplished  in  each  of  the  three-contract-year  duration.  Our 
research  in  the  first  contract-year  (2008-2009)  had  achieved  the  first  two  slightly  modified 
aims/tasks:  (1)  we  reconstituted  the  FK228  biosynthetic  gene  cluster  and  other  necessary  DNA 
elements  onto  three  compatible  plasmids,  and  (2)  upon  introduction  of  those  plasmids  into  E. 
coli  cells,  the  engineered  E.  coli  strain  was  able  to  express  the  genes  and  produced  FK228 
under  both  aerobic  and  anaerobic  fermentation  conditions. 

The  project  goal  of  the  second  contract-year  (2009-2010)  was  to  integrate  the  FK228  gene 
cluster  into  E.  coli  chromosome  for  stable  functioning  without  antibiotic  selection  (hence  to 
create  marker-free  genetically  modified  organism  -  GMO,  as  required  by  FDA  regulations). 

Key  Research  Status 

1.  Reconstitution  of  a  complete  FK228  biosynthetic  gene  cluster  on  a  single  vector  -  mini  Hi  mar 
transposon 

In  order  to  forcefully  integrate  a  functional  FK228  biosynthetic  gene  cluster  into  E.  coli 
chromosome,  first  it  is  necessary  to  reconstitute  the  entire  biosynthetic  gene  cluster  on  a  single 
vector.  We  chose  miniHimar  RBI  transposon  as  the  vector  for  this  purpose  because  it  has  been 
reported  to  be  able  to  integrate  randomly  into  the  chromosome  of  a  broad  range  of  bacterial 
species  (Bouhenni,  Gehrke  et  al.  2005;  Choi  and  Kim  2009). 

We  employed  complicated  DNA  cloning  schemes  (details  omitted  for  simplicity)  and 
eventually  reconstituted  a  complete  FK228  biosynthetic  gene  cluster  from  the  three  plasmids 
made  during  the  first  year  of  research.  This  gene  cluster  includes  sequentially  a  left-end 
inverted  repeat  sequence  (IR),  a  removable  FRT  cassette,  an  interchangeable  promoter 
responsive  to  anaerobic  condition,  a  stretch  of  dep-genes  ( depR ,  depO,  depP,  depM,  depN, 
depA  through  depJ),  a  gfp  reporter  gene,  and  a  right-end  inverted  repeat  sequence: 

0  5  10  15  20  25  30  35  40  kb 
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2.  Unsuccessful  attempts  to  integrate  the  engineered  FK228  gene  cluster  into  E.  coli 
chromosome 

We  attempted  numerous  times  to  introduce  and  integrate  the  FK228  gene  cluster-carrying 
transposon  into  E.  coli  cells  by  transformation  or  electroporation  under  different  conditions 
(varying  amount  of  DNA,  duration  of  treatment,  temperature,  antibiotic  selection  strength,  etc.). 
The  transformation  efficiency  was  generally  extremely  low  (<10'9  transformants  per  ug  DNA).  A 
few  transformants  were  indeed  obtained  after  extended  period  of  incubation  at  37°C.  However, 
upon  analysis  of  the  integrated  DNA  by  overlapping  PCR  amplification,  we  found  that  the  FK228 
gene  cluster  was  either  partially  deleted  or  rearranged.  Consequentially  no  FK228  was  detected 
by  LC-MS  from  the  fermentation  broth  of  any  E.  coli  transformants. 

We  attributed  the  failure  to  the  large  size  of  the  DNA  construct  (~45  kb  total)  and  perhaps 
two  known  internal  homologous  regions  within  the  gene  cluster. 

Facing  this  insurmountable  technical  difficulty,  we  had  to  decide  to  suspend  this  experiment. 
Meanwhile  we  searched  and  conceived  an  alternative  project  design,  which  may  circumvent  the 
above  technical  difficulty  (see  below). 


3.  Alternative  Project  Design:  Engineering  and  integrating  a  small  tryprostatin  B  biosynthetic 
gene  cluster  (~9  kb)  into  E.  coli  chromosome  and  anaerobic  bacterial  chromosome 

Tryprostatin  A  and  tryprostatin  B  are  natural  products  of  a  fungal  species  Aspergillus 
fumigatus  BM939,  with  reported  bioactivity  as  potent  inhibitors  of  mammalian  cell  cycle  (Cui, 
Kakeya  et  al.  1995;  Cui,  Kakeya  et  al.  1996;  Cui,  Kakeya  et  al.  1996)  and  as  inhibitors  of  a 
breast  cancer  resistance  protein  (BCRP)  (Woehlecke,  Osada  et  al.  2003;  Jain,  Zhang  et  al. 
2008).  A  gene  cluster  responsible  for  the  biosynthesis  of  tryprostatins  has  been  characterized 
(Kato,  Suzuki  et  al.  2009;  Maiya,  Grundmann  et  al.  2009)  and  a  model  of  biosynthesis  is 
proposed  as  such: 


Tryprostatin  B  is  made  from  L-Trp  and  L-Pro  by  two  steps  of  reaction  catalyzed  by  FtmA 
(encoded  by  a  6.6-kb  gene  ftmA)  and  FtmB  (encoded  by  a  1.6-kb  gene  ftmB).  Therefore  it  takes 
a  very  small  gene  cluster  (promoter  plus  ftmA  and  ftmB ,  about  9  kb  DNA)  to  sufficiently  produce 
an  anticancer  compound  tryprostatin  B.  Two  additional  enzymes  (a  cytochrome  P450  encoded 
by  ~1.7  kb  DNA  and  O-methyltransferase  encoded  by  -1.0  kb  DNA,  respectively)  can  further 
convert  tryprostatin  B  to  tryprostatin  A. 

We  have  obtained  the  fungal  strain,  have  designed  and  initiated  this  alternative  project: 

(1)  To  clone  both  ftmA  and  ftmB  genes  under  an  fnr  promoter  into  a  cloning  vector  pSP72; 

(2)  To  verify  the  gene  fidelity  by  re-sequencing; 

(3)  To  subclone  the  entire  ftm  gene  cluster  (-9  kb)  into  the  miniHimar  RBI  transposon; 

(4)  To  attempt  to  integrate  the  ftm  gene  cluster  into  E.  coli  chromosome; 

(5)  Once  we  have  achieved  those  milestones,  we  will  exam  the  productivity  of  tryprostatin  B 
by  the  engineering  E.  coli  strain.  Should  the  result  be  positive,  we  will  attempt  to  introduce  and 
integrate  the  gene  cluster  into  anaerobic  Bifidobacterium  logum  chromosome; 

(6)  We  will  then  evaluate  the  antitumor  activity  of  engineered  bio-agents. 
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Reportable  Outcome 

A  manuscript  has  been  favorably  reviewed  and  is  under  revision  for  publishing  in  Applied 
and  Environmental  Microbiology  (see  Appendix). 

Summary 

We  have  achieved  partial  success  toward  engineering  novel  antitumor  bio-agents  by 
reconstituting  and  expressing  the  FK228  biosynthetic  gene  cluster  in  E.  coli  cells.  Efforts  to 
integrate  the  gene  cluster  into  E.  coli  chromosome  have  suffered  serious  setback  due  to 
insurmountable  technical  difficulty.  For  this  reason  we  have  designed  and  initiated  an  alternative 
project  that  will  employ  a  much  smaller  gene  cluster;  this  gene  cluster  is  responsible  for  the 
biosynthesis  of  tryprostatin  B,  a  potent  inhibitor  of  breast  cancer  resistance  protein  (BCRP). 

We  have  confidence  to  achieve  the  project  goal  in  about  two  years  from  now.  Therefore  I 
here  request  a  non-budgetary  extension  of  the  entire  project  duration  to  end  by  09/14/2012. 
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ABSTRACT 


2 

3  Functional  cross-talk  between  fatty  acid  biosynthesis  and  secondary  metabolism  has 

4  been  discovered  in  several  cases  in  microorganisms,  none  of  them  however  involves  a 

5  modular  polyketide  biosynthetic  enzyme.  Previously  we  reported  a  hybrid  modular 

6  nonribosomal  peptide  synthetase-polyketide  synthase  (PKS)  pathway  for  the  biosynthesis 

7  of  FK228  anticancer  depsipeptide  in  Chromobacterium  violaceum  no.  968.  This  pathway 

8  contains  two  PKS  modules  on  the  DepBC  enzymes  that  lack  a  functional  acyltransferase 

9  (AT)  domain,  and  no  apparent  AT-encoding  gene  exists  within  the  gene  cluster  or  its 

10  vicinity.  We  report  here  that,  through  reconstitution  of  the  FK228  biosynthetic  pathway  in 

1 1  E.  coli  cells,  two  essential  genes,  fabDl  and  fabD2,  both  encoding  a  putative  malonyl  CoA 

12  acyltransferase  component  of  the  fatty  acid  synthase  complex,  are  positively  identified  to  be 

13  involved  in  FK228  biosynthesis.  Either  gene  product  appears  sufficient  to  complement  the 

14  “AT-less”  PKS  modules  on  DepBC  for  polyketide  chain  elongation.  Concurrently  a  gene 

15  ( sfp )  encoding  a  putative  Sfp-type  phosphopantetheinyltransferase  was  identified  to  be 

16  necessary  for  FK228  biosynthesis  as  well.  Most  interestingly,  an  engineered  E.  coli  strain 

17  carrying  all  necessary  genetic  components  produced  significant  levels  of  FK228  under  both 

18  aerobic  and  anaerobic  cultivation  conditions.  Discovery  of  the  traits  complementation  of 

19  modular  PKSs  by  housekeeping  ATs  reveals  natural  product  biosynthesis  diversity. 

20  Moreover,  demonstration  of  anaerobic  production  of  FK228  by  an  engineered  facultative 

21  bacterial  strain  validates  our  effort  toward  engineering  of  novel  tumor-targeting  bio- 

22  agents. 

23 
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INTRODUCTION 


2  Polyketides  and  nonribosomal  peptides  constitute  two  large  families  of  natural  products  that 

3  are  biosynthesized  by  polyketide  synthases  (PKSs)  or  nonribosomal  peptide  synthetases 

4  (NRPSs),  respectively  (16,  24).  There  are  also  increasing  number  of  natural  products  identified 

5  as  hybrid  polyketide-nonribosomal  peptide  molecules  that  are  biosynthesized  by  a  combination 

6  of  PKS  and  NRPS  enzymes  (14,  29).  PKSs  function  by  carrying  out  successive  rounds  of 

7  decarboxylative  Claisen  condensation  of  short  carboxylic  acid  moieties  derived  from  malonyl 

8  coenzyme  A  (CoA),  methylmalonyl  CoA  and  a  few  other  less  common  acyl  CoAs  (5,  34).  In 

9  parallel,  NRPSs  condense  amino  acids  or  amino  acid  derivatives  by  forming  amide  bond 

1 0  between  two  building  blocks  (15). 

1 1  PKSs  share  many  similarities  with  fatty  acid  synthases  (FASs)  and  can  be  largely  classified 

12  as  type  I,  II  and  III,  according  to  their  architecture  and  mode  of  catalysis;  yet  many  variants  of 

13  PKSs  exit  in  nature  (5,  32,  34).  Type  I  PKSs  are  multifunctional  enzymes  organized  into 

14  modules,  each  of  which  harbors  a  set  of  distinct  domains  responsible  for  the  catalysis  of  one 

1 5  cycle  of  polyketide  chain  elongation.  A  prototypical  type  I  PKS  elongation  module  contains 

16  minimally  three  integral  domains  -  a  ketosynthase  (KS),  an  acyltransferase  (AT)  and  an  acyl 

17  carrier  protein  (ACP)  -  that  together  catalyze  one  round  of  polyketide  chain  elongation.  Optional 

18  domains  (such  as  ketoreductase  [KR],  dehydratase  [DH],  enoylreductase  [ER])  are  found 

19  between  the  AT  and  ACP  domains,  which  carry  out  variable  steps  of  reductive  modifications  on 

20  polyketide  intermediate.  Type  I  PKSs  can  be  subdivided  into  iterative  and  noniterative, 

2 1  depending  on  whether  the  same  set  of  domains  is  used  once  or  multiple  times  during  the 

22  biosynthesis  of  a  polyketide  molecule.  Type  II  PKSs  are  multienzyme  complexes  that  carry  a 

23  single  set  of  catalytic  domains  acting  iteratively.  Type  III  PKSs  are  chalcone  synthase  (CHS)- 
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1  like  enzymes  that  essentially  are  iteratively  acting  condensing  enzymes.  FASs,  and  type  I  and 

2  type  II  PKSs  all  use  ACPs  to  tether  acyl  CoA  substrates  during  fatty  acid  or  polyketide 

3  biosynthesis,  whereas  type  III  PKSs  directly  condense  acyl  CoA  substrates  without  carrier 

4  proteins.  In  parallel,  NRPSs  use  peptidyl  carrier  proteins  (PCPs)  to  tether  aminoacyl  CoA 

5  substrates  during  nonribosomal  peptide  biosynthesis.  Prior  to  biosynthesis,  apo  form  of  ACPs  or 

6  PCPs  must  be  activated  to  holo  form  by  attaching  a  4’-phosphopantetheinyl  moiety  from  CoA 

7  onto  a  conserved  serine  residue;  this  posttranslational  modification  reaction  is  catalyzed  by 

8  phosphopantetheinyltransferases  (PPTases)  that  can  be  largely  classified  into  two  groups,  AcpS- 

9  type  and  Sfp-type  (12,  21,  26). 

10  Integral  AT  domains  in  the  prototypical  type  I  PKSs  were  termed  cognate  ATs  (8).  A  distinct 

1 1  variant  of  the  noniterative  type  I  PKSs  contains  no  intact  cognate  ATs  but  a  short  segment  of 

12  remnant  AT  sequence  in  some  or  all  modules.  This  subclass  of  type  I  PKSs  was  named  the  “AT- 

13  less”  type  I  PKSs  and  the  remnant  AT  segment  the  AT  docking  domain  (7,  8,  36).  The  essential 

14  AT  activities  are  provided  in  trans  by  discrete  AT  enzymes  encoded  by  genes  that  are  physically 

1 5  separated  from  the  PKS  genes,  but  nevertheless  within  the  same  gene  cluster. 

16  Two  unusual  natural  product  biosynthetic  gene  clusters  responsible  for  the  biosynthesis  of 

17  actinorhodin  (28)  or  tetracenomycin  (4,  17,  35)  encode  type  II  PKSs  that  lack  an  AT  subunit  and 

18  no  AT-encoding  gene  is  present  within  the  gene  cluster  boundaries.  The  AT  activity  necessary 

19  for  polyketide  chain  extension  is,  instead,  provided  by  a  malonyl  CoA  acyltransferase  (MCAT, 

20  also  known  as  FabD,  encoded  by  the  fabD  gene)  of  type  II  fatty  acid  synthase  complex  (FAS  II). 

2 1  Recruitment  of  an  FAS  II  component  for  polyketide  biosynthesis  establishes  a  functional  cross- 

22  talk  between  the  bacterial  primary  metabolism  and  secondary  metabolism. 

23  FK228  is  an  interesting  bicyclic  depsipeptide  natural  product  recently  approved  by  the  US 
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1  Food  and  Drug  Administration  (FDA)  as  a  new  class  of  anticancer  drug  for  the  treatment  of 

2  cutaneous  T-cell  lymphoma  (CTCL)  (1).  FK228  epigenetically  affects  tumorigenesis  by 

3  inhibiting  histone  deacetylases  (18).  Our  research  interests  reside  on  the  mechanism  by  which 

4  FK228  is  biosynthesized  and  on  metabolic  engineering  of  the  FK228  biosynthetic  pathway  for 

5  generating  structural  analogs  for  drug  discovery.  Previously  we  have  reported  cloning  of  the 

6  FK228  biosynthetic  ( dep )  gene  cluster  and  characterization  of  a  critical  gene  ( depH)  responsible 

7  for  disulfide  bond  formation  as  the  final  step  in  FK228  biosynthesis  in  Chromobacterium 

8  violaceum  no.  968  (9,  37).  In  a  companion  paper  we  further  described  a  precise  determination  of 

9  the  dep  gene  cluster  boundaries  through  systematic  gene  mutation  and  transcriptional  analysis 

10  (27).  As  a  result,  the  dep  gene  cluster  is  now  redefined  to  contain  12  genes:  depA  through  depJ, 

1 1  depM,  and  a  newly  identified  pathway  regulatory  gene  depR. 

12  There  are  still  several  unanswered  questions  regarding  the  FK228  biosynthetic  pathway.  In 

13  particular,  the  two  PKS  modules  on  DepBC  enzymes  do  not  contain  any  cognate  AT  domain  but 

14  a  remnant  AT  docking  domain  (Fig.  lc),  and  there  is  no  AT-encoding  gene  anywhere  in  the 

15  defined  dep  gene  cluster  (9,  27).  Where  does  the  AT  activity  necessary  for  FK228  biosynthesis 

16  come  from?  In  this  work  we  report  the  identification  of  two  essential  genes,  fabDl  and  fabD2, 

17  both  encoding  a  putative  MCAT  component  of  the  FAS  II  complex,  that  are  positively  involved 

18  in  FK228  biosynthesis.  Concurrently  we  identified  a  gene  ( sfp )  encoding  a  putative  Sfp-type 

19  PPTase  necessary  for  FK228  biosynthesis  as  well.  Furthermore,  we  show  that,  through 

20  reconstitution  of  the  FK228  biosynthesis  pathway,  FK228  could  be  produced  by  recombinant  E. 

21  coli  strains  under  both  aerobic  and  anaerobic  cultivation  conditions.  Our  studies  revealed,  for  the 

22  first  time,  that  modular  PKSs  recruit  FabD  components  of  the  primary  metabolism  for  the 

23  biosynthesis  of  a  secondary  metabolite.  Discovery  of  the  trans  complementation  of  modular 
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1  PKSs  by  housekeeping  ATs  reveals  natural  product  biosynthesis  diversity.  Moreover, 

2  demonstration  of  anaerobic  production  of  FK228  by  an  engineered  facultative  bacterial  strain 

3  validates  our  effort  toward  engineering  of  novel  tumor- targeting  bio-agents  (6). 

4 

5  MATERIALS  AND  METHODS 

6  Bacterial  strains,  plasmids,  culture  conditions,  and  general  molecular  biological 

7  manipulations.  The  bacterial  strains  and  plasmids  used  in  this  study  are  listed  in  Table  1. 

8  Culture  conditions  and  general  molecular  biological  manipulations  were  performed  as  described 

9  (9,  37),  or  according  to  standard  protocols  (30). 

10  Rapid  genome  sequencing  and  gene  identification.  Genomic  DNA  of  the  wild  type  C. 

1 1  violaceum  no.  968  strain  was  prepared  from  an  overnight  culture  with  an  UltraClean  Microbial 

12  DNA  Isolation  kit  (MO  BIO  Labs,  Carlsbad,  CA),  and  was  submitted  for  shotgun  single-end  and 

13  paired-end  shotgun  sequencing  on  a  G20  FLX  platform  (454  Life  Science,  Branford,  CT)  at  the 

14  Research  Technology  Support  Facility  of  Michigan  State  University  (East  Lansing,  MI). 

15  Standard  de  novo  assembly  of  sequence  reads  was  performed  by  the  facility  personnel  and  a  draft 

16  genome  sequence  was  provided  to  us  as  the  end  users.  Candidate  genes  were  identified  by  using 

17  known  protein  sequences  as  bait  to  search  the  draft  genome  sequence  of  C.  violaceum  by  Blastx 

18  algorithm  (3). 

19  General  strategies  for  targeted  gene  deletion  in  C.  violaceum  no.  968.  A  multiplex  PCR 

20  method,  as  described  elsewhere  (10,  27,  37),  was  used  for  all  intended  gene  deletion 

21  experiments.  This  method  utilized  a  broad  host-range  Flp-FRT  recombination  system  for  site- 

22  specific  gene  replacement/deletion  and  subsequent  marker  removal  (10,  19).  Primers  used  for 

23  making  gene  deletion  constructs  and  for  detection  of  genotypes  are  listed  in  Table  SI. 
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1  Reconstitution  of  FK228  biosynthetic  gene  cluster  in  engineered  E.  coli  strains.  To  probe 

2  whether  individual  candidate  AT-encoding  genes  or  PPTase-encoding  genes  are  involved  in 

3  FK228  biosynthesis,  a  three-plasmid  system  was  utilized  for  gene  expression  and  FK228 

4  biosynthesis  in  bacterial  strains  derived  from  E.  coli  BL21(DE3)  (Fig.  2a).  First,  all  candidate 

5  genes  were  amplified  by  high  fidelity  PCR  from  the  genomic  DNA  of  C.  violaceum  with  primer 

6  sets  carrying  designed  restriction  sites.  Second,  two  candidate  PPTase-encoding  gene  amplicons 

7  were  double-digested  with  BamHI/Hindlll  and  individually  cloned  into  the  first  multiple  cloning 

8  site  (MCS)  of  pCDFDuet-1  to  create  two  intermediate  constructs,  pCDFDuct- 1  -acpS  and 

9  pCDFDuet-1  -sfp.  Third,  two  candidate  AT-encoding  gene  amplicons  were  double-digested  with 

10  Ndel/Kpnl  and  individually  cloned  into  the  second  multiple  cloning  site  of  pCDFDuet-1  to 

1 1  generate  pCDFDuet-1  -fabDl  and  pCDFDuet-1  -fabD2,  or  into  the  two  previously  made 

12  intermediate  constructs  to  create  a  combination  of  4  final  constructs,  pCDFDuet-1  -acpS-fabDl, 

13  pCDFDuet-1  -acpS-fabD2,  pCDFDuet-1  -sfp-fabDl,  pCDFDuet-1  -sfp-fabD2  (Table  1).  Finally, 

14  the  Cosmid  18,  which  carries  the  original  incomplete  FK228  biosynthetic  gene  cluster  that  lacks 

15  any  AT-encoding  gene  or  PPTase-encoding  gene  (Cheng  et  al,  2007),  and  pBMTL-3 -depR, 

16  which  was  created  to  complement  a  depR- deletion  mutant  of  C.  violaceum  (27),  were  used  in 

17  combination  with  the  above  expression  constructs  for  the  transformation  of  E.  coli  BL2 1(DE3) 

18  cells  to  create  a  series  of  bacterial  strains  (Table  1).  Kanamycin  at  25  pg/ml,  chloramphenicol  at 

19  25  pg/ml,  and  streptomycin  at  25  pg/ml  were  used  individually  or  in  combination  for  selection 

20  and  maintenance  of  respective  E.  coli  strains. 

21  Bacterial  fermentation  and  quantification  of  FK228  production  by  liquid 

22  chromatography-mass  spectrometry  (LC-MS).  Wild  type  C.  violaceum  strain  and 

23  recombinant  E.  coli  strains  were  fermented  aerobically  for  4  days  at  30°C  under  constant 
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1  agitation  (200  rpm)  in  50  ml  of  LB  media  supplemented  with  1%  (w/v)  Diaion  HP-20  resin 

2  (Sigma- Aldrich,  St.  Louis,  MO)  and  appropriate  antibiotics  where  necessary.  Gene  expression 

3  was  induced  with  0.5%  (w/v)  lactose  and  0. 1  mM  IPTG  when  bacterial  culture  reached  an  OD6oo 

4  of  0.4.  Strict  anaerobic  fermentation  of  bacterial  strains  was  carried  out  similarly  for  5  days  at 

5  room  temperature  in  a  Coy  anaerobic  chamber  (Grass  Lake,  MI)  with  occasional  manual 

6  agitation,  except  for  that  0.05%  (w/v)  thioglycolate  was  added  to  the  media  to  capture  any 

7  oxidative  species  generated  during  fermentation.  Extraction  of  metabolites,  and  detection  and 

8  quantification  of  FK228  by  LC-MS  were  performed  as  described  (27). 

9  RNA  extraction  and  reverse  transcription  (RT)-PCR.  Recombinant  E.  coli  BL21(DE3) 

10  strains  were  grown  in  LB  media  supplemented  with  appropriate  antibiotics  at  30°C  under 

1 1  constant  agitation  (200  rpm)  to  an  ODeoo  of  0.4.  Five  ml  of  each  pre-induction  sample  was 

12  collected  and  the  remaining  cultures  were  induced  with  0.5%  (v/v)  lactose  and  0.1  mM  IPTG. 

13  Aliquots  of  sample  were  collected  at  60  min  and  again  at  120  min  post  induction.  Preservation  of 

14  sample  aliquots,  extraction  of  total  RNA,  and  RT-PCR  experiments  were  performed  as  described 

15  (27).  Primers  used  for  detection  of  individual  gene  expression  are  listed  in  Table  SI. 

16  Nucleotide  sequence  accession  numbers.  The  nucleotide  sequences  of  C.  violaceum  no.  968 

17  genes  reported  in  this  paper  have  been  deposited  in  the  GenBank  database  under  accession 

18  numbers  HM449690  for  16S  rRNA  gene,  HM449691  for  fabDl,  HM449692  for  fabD2, 

19  HM449693  for  A  T3,  HM449694  for  acpS,  and  HM449695  for  sfp,  respectively. 

20 

21  RESULTS 

22  Draft  genome  sequencing  of  Chromobacterium  violaceum  no.  968.  Shotgun  sequencing  of 

23  C.  violaceum  genomic  DNA  on  a  GS20  FLX  Sequencer  generated  a  total  of  163,954,650  input 
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1  bases,  which  were  assembled  into  122  contigs;  among  them  82  are  large  contigs  (>500  nt)  with 

2  an  average  contig  size  of  59,5 14  bps.  Those  contigs  were  further  aligned  into  15  scaffolds  with  a 

3  total  length  of  4,909,141  bps.  The  sequence  coverage  for  this  draft  bacterial  genome  was  thus 

4  calculated  at  33.4  folds,  which  exceeded  the  desired  30-fold  oversampling  of  raw  sequence  for 

5  the  454  pyrosequencing  and  de  novo  assembly  technology  platform  (13).  Compared  to  the 

6  published  4.75-Mb  complete  genome  of  a  type  strain  of  C.  violaceum  ATCC  12472  with  a  (G  + 

7  C)  content  of  64.8%  (11),  the  4.91-Mb  draft  genome  sequence  of  C.  violaceum  no.  968  with  an 

8  overall  (G  +  C)  content  of  61.9%  obtained  in  this  study  appears  to  be  near  complete.  The  quality 

9  of  this  draft  genome  sequence  was  assessed  by  the  following  analyses.  First,  a  comparison  of  the 

10  16S  rDNA  sequences  of  two  C.  violaceum  strains  revealed  an  overall  96%  identity  without  a 

1 1  single  gap  (Fig.  SI),  indicating  a  high  quality  of  the  draft  genome  sequence  and  a  taxonomical 

12  relatedness  of  the  two  strains.  Second,  a  homology  search  by  Blastn  algorithm  of  the  draft 

13  genome  sequence  using  our  previously  published  FK228  biosynthetic  gene  cluster  sequence 

14  (GenBank  no.  EF210776)  (9)  as  bait  identified  two  contigs  that  carry  the  gene  cluster  with  a 

15  100%  sequence  identify  and  with  a  1298-bp  sequence  gap  (data  not  shown).  This  gap  was 

16  artificially  created  due  to  two  highly  homologous  regions  within  the  gene  cluster  that  were 

17  assembled  into  one  copy  of  sequence. 

18  Identification  and  initial  characterization  of  candidate  genes.  Sequences  of  three  bait 

19  proteins,  FabD  of  E.  coli  K-12  (GenBank  no.  AAC74176),  FabD  of  C.  violaceum  ATCC  12472 

20  (GenBank  no.  NP  903085)  and  LnmG  of  Streptomyces  atroolivaceus  S-140  (GenBank  no. 

21  AAN85520),  were  used  to  search  the  draft  genome  sequence  and  identified  three  candidate 

22  genes,  fabD  1 ,  fabD2  and  AT3,  that  encode  putative  AT  enzymes  (FabD,  MCAT  or  AT)  (Table  2; 

23  Fig.  lb).  Based  on  bioinfonnatics  analysis,  the  fabDl  gene  lies  within  an  apparent  FAS  II  gene 
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1  cluster  that  also  includes  fabH,fabG,  acp  and  fabF.  The  fabD2  gene  is  a  standalone  gene  whose 

2  function  cannot  be  predicted  a  priori.  The  AT3  gene  lies  within  a  putative  gene  cluster  that  may 

3  involve  in  cell-surface  O-antigen  biosynthesis. 

4  Similarly,  sequences  of  five  bait  proteins,  AcpS  of  E.  coli  K-12  (GenBank  no.  P24224),  Sfp 

5  of  Bacillus  subtilis  (GenBank  no.  P39135),  AcpS  of  C.  violaceum  ATCC  12472  (GenBank  no. 

6  NP_901742),  EntD  of  C.  violaceum  ATCC  12472  (GenBank  no.  NP_902320)  and  PcpS  of 

7  Pseudomonas  aeruginosa  PAOl  (GenBank  no.  AAG04554),  were  used  to  search  the  draft 

8  genome  sequence  and  identified  two  candidate  genes,  acpS  and  sfp,  that  encode  putative  PPTase 

9  enzymes  (AcpS;  Sfp)  (Table  2;  Fig.  la).  The  acpS  gene  appears  to  be  a  standalone  gene  which  is 

10  likely  involved  in  primary  metabolite  (e.g.  fatty  acid)  biosynthesis.  The  sfp  gene  is  located  at  the 

1 1  end  of  an  apparent  NRPS  gene  cluster,  therefore  it  is  likely  involved  in  secondary  metabolite 

12  (e.g.  nonribosomal  peptides  or  hybrid  molecules)  biosynthesis. 

13  Individual  candidate  genes  were  subjected  to  targeted  gene  deletion  by  an  established 

14  multiplex  PCR  procedure  to  probe  whether  they  play  any  roles  in  FK228  biosynthesis.  Gene  AT3 

1 5  was  successfully  mutated  and  the  mutant  strain  did  not  show  any  notable  difference  from  the 

16  wild  type  strain  (data  not  shown),  suggesting  that  A  T3  is  disposable  and  is  independent  of  FK228 

17  biosynthesis.  Thus  A  T3  was  not  further  tested.  In  contrast,  four  other  candidate  genes ,fabDl, 

18  fabD2,  acpS  and  sfp,  could  not  be  mutated  despite  numerous  attempts,  indicating  that  they  are  all 

19  essential  to  the  bacterial  physiology  and  survival.  They  were  then  subjected  to  tests  by  a  different 

20  strategy  described  below. 

21  Reconstitution  of  the  FK228  biosynthetic  pathway  in  E.  coli.  A  series  of  recombinant  E. 

22  coli  strains  were  created  herein  (Table  1;  Fig.  2a)  and  the  relative  levels  of  FK228  production  by 

23  these  strains  were  examined  by  LC-MS  (Fig.  3;  Table  S2;  Fig.  S2).  When  Cosmid  18,  the 
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1  originally  identified  large  construct  that  carries  the  dep  gene  cluster  which  definitely  lacks  an 

2  AT-encoding  gene  and  a  PPTase-encoding  gene  (9),  was  introduced  into  E.  coli  BL21(DE3) 

3  cells,  the  recombinant  strain  SW01  failed  to  produce  FK228.  When  a  newly  defined  pathway 

4  regulatory  gene,  depR  (27),  was  introduced  into  SW01  via  a  broad-host  range  construct  pBMTL- 

5  3-depR,  the  resulting  strain  SW02  still  did  not  produce  any  detectable  level  of  FK228.  When  the 

6  candidate  sfp  gene  was  introduced  into  SW02  via  a  compatible  construct  pCDFDuct- 1  -sfp,  the 

7  resulting  strain  SW03  produced  a  moderate  level  of  FK228.  When  either  the  candidate  fabDl  or 

8  fabD2  gene  was  added  in  tandem  with  sfp  on  pCDFDuct- 1  -sfp-fabDI  or  pCDFDuct- 1  -sfp-fabD2 

9  construct,  the  resulting  strains  of  SW07  and  SW08  produced  significant  levels  of  FK228.  All 

10  other  strains  that  received  a  single  candidate  gene  of  either  acpS  (strain  SW04),  fabDl  (strain 

1 1  SW05)  or  fabD2  (strain  SW06),  or  two  genes  with  acpS  in  tandem  with  either  fabDl  (strain 

12  SW9)  or  fabD2  (strain  SW10)  did  not  show  production  of  detectable  level  of  FK228.  Those 

13  observations  led  to  the  following  conclusions:  (1)  the  sfp  gene  but  not  the  acpS  gene  is  capable 

14  of  converting  all  PCPs  and  ACPs  in  the  FK228  biosynthetic  pathway  from  their  inactive  apo 

15  form  to  their  active  holo  form;  (2)  indigenous  E.  coli  AcpS  cannot  promiscuously  act  on  those 

16  heterologous  carrier  proteins  from  a  secondary  metabolic  pathway;  (3)  either  gene  product  of  the 

17  essential  fabDl  or  fabD2  gene  of  C.  violaceum  no.  968  is  able  to  complement  the  “AT-less”  PKS 

18  modules  on  DepBC  for  polyketide  chain  extension;  (4)  even  E.  coli  FabD  (in  the  background  in 

19  case  of  strain  SW03)  is  able  to  provide  the  necessary  AT  activities  for  FK228  biosynthesis  albeit 

20  at  a  lower  rate;  (5)  at  this  point  we  believe  that  genes  necessary  for  FK228  biosynthesis  have  all 

21  been  identified. 

22  RT-PCR  verification  of  gene  expression.  To  verify  that  key  biosynthetic  and  regulatory 

23  genes  for  FK228  biosynthesis  were  adequately  expressed  in  the  SW07  recombinant  strain  under 
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1  normal  aerobic  conditions,  aliquots  of  bacterial  culture  were  collected  at  three  time  points,  total 

2  RNA  samples  were  prepared  and  subjected  to  semi-quantitative  RT-PCR  analysis  (Fig.  2b).  Prior 

3  to  chemical  induction,  all  examined  genes  including  two  representative  structural  genes  ( depA 

4  and  depJ)  but  excluding  the  16S  rDNA  control  were  not  expressed.  This  suggests  that  the  depR 

5  gene  in  its  native  position  on  Cosmid  18  is  not  functioning  in  E.  coli  cells,  likely  due  to  a  lack  of 

6  proper  external  or  internal  stimulus,  or  signal  transduction  pathway;  expression  of  the  dep  gene 

7  cluster  carried  by  Cosmid  18  in  E.  coli  cells  requires  an  ectopic  copy  of  depR  driven  by  lactose- 

8  inducible  promoter  on  a  pBMTL-3  vector.  Upon  induction  by  lactose  and  IPTG  for  the 

9  expression  of  depR  from  pBMTL-3 -depR  and  for  the  expression  of  sfp  and  fabDl  from 

10  pCDFDuet-1  -sfp-fabDl,  those  genes  and  two  representative  structural  genes  (dep A  and  depJ) 

1 1  were  found  to  have  expressed  60  min  post  induction;  and  the  gene  expression  reached  higher 

12  levels  120  min  post  induction. 

13  Production  of  FK228  under  anaerobic  fermentation  conditions.  As  a  pilot  study  toward 

14  engineering  of  novel  tumor- targeting  bacterial  agents  that  may  effectively  infiltrate,  multiply  and 

15  continuously  produce  an  anticancer  drug  inside  the  hypoxic  core  of  solid  tumors  (6),  we 

16  examined  whether  the  engineered  SW07  strain  may  produce  FK228  under  anaerobic 

17  fermentation  conditions.  To  our  delight  the  SW07  strain  produced  c.a.  14.1%  as  much  FK228  as 

18  did  by  the  wild  type  C.  violaceum  strain  or  15.3%  by  the  SW07  strain  itself  under  aerobic 

19  conditions  (Fig.  3;  Table  S2;  Fig.  S2);  this  relative  level  of  FK228  production  was  translated  into 

20  0.40  mg/L  of  actual  yield  or  c.a.  741  nM  concentration  when  normalized  by  FK228  standard.  As 

21  a  control,  the  wild  type  C.  violaceum  strain  was  also  found  to  have  produced  a  lower  level  of 

22  FK228  under  anaerobic  conditions.  This  Gram-negative  bacterial  species  appeared  to  be  able  to 

23  survive  and  grow  slightly  during  the  first  24  hr  under  the  strict  anaerobic  conditions  tested. 
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2  DISCUSSION 

3  Organizational  and  functional  variation  of  PKSs  leads  to  diverse  structures  of  polyketide 

4  natural  products  (5,  32).  The  modular  “AT-less”  PKSs  represent  a  severe  deviation  from  the 

5  canonical  type  I  PKSs  (7,  8)  and  this  phenomenon  appears  to  be  a  transition  state  of  complex 

6  enzyme  evolution  from  FAS  II  to  type  II  PKSs  (33).  Nevertheless  in  most  “AT-less”  PKS 

7  pathways  the  AT  activities  required  for  polyketide  chain  elongation  are  often  encoded  by 

8  discrete  genes  within  the  respective  biosynthetic  gene  clusters.  In  the  present  work,  for  the  first 

9  time,  we  uncovered  an  extreme  case  of  “AT-less”  PKS  system  where  the  AT  activity  is  encoded 

10  by  fabDs,  the  essential  bacterial  genes  involved  in  fatty  acid  biosynthesis.  Support  for  this 

1 1  conclusion  came  from  several  lines  of  evidence. 

12  First,  sequence  analysis  of  the  two  PKS  modules  on  DepBC  did  not  find  an  intact  integral  AT 

13  domain  but  remnant  of  AT-docking  segment  (9);  thus  DepBC  enzymes  can  be  classified  as 

14  modular  “AT-less”  PKSs.  Further  analysis  failed  to  identify  any  gene  within  or  in  the  vicinity  of 

15  the  redefined  dep  gene  cluster  (27).  To  address  the  mystery  about  where  the  necessary  AT 

16  activities  would  come  from,  the  genome  of  C.  violaceum  no.  968  was  decoded  by  a  rapid 

17  genome  sequencing  platform,  revealing  three  candidate  genes  that  may  encode  the  AT  activities 

18  for  FK228  biosynthesis.  However,  attempts  to  identify  the  exact  AT-encoding  gene  by  a  direct 

19  gene  deletion  approach  was  not  successful,  because  two  candidate  genes  (fabDl  and  fabDT) 

20  could  not  be  mutated  due  to  their  apparent  essential  roles  in  bacterial  viability.  A  third  gene 

21  (A  T3)  could  be  mutated  but  the  mutant  did  not  show  any  obvious  sign  of  physiological  defect  or 

22  decrease  in  FK228  production. 
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1  We  then  engineered  a  series  of  recombinant  E.  coli  strains  that  harbor  one  to  three 

2  compatible  vectors  that  carry  either  the  previously  identified  dep  gene  cluster  or  candidate  genes 

3  identified  in  this  work  with  three  objectives  in  mind  (Fig.  2a).  First,  we  hoped  to  identify  the 

4  missing  AT-encoding  gene  and  PPTase-encoding  gene  necessary  for  FK228  biosynthesis. 

5  Second,  we  hoped  to  reconstitute  a  functional  FK228  biosynthetic  pathway  in  a  heterologous 

6  host  such  as  E.  coli  cells.  Third,  we  hoped  to  demonstrate  that  FK228  could  be  produced  by  a 

7  recombinant  E.  coli  strain  under  both  aerobic  conditions  and  particularly  anaerobic  conditions  as 

8  a  pilot  experiment  for  genetic  engineering  of  novel  cancer-targeting  bio-agents  (6). 

9  Three  engineered  E.  coli  strains,  SW03,  SW07  and  SW08,  were  subsequently  found  to  be 

10  able  to  produce  variable  amounts  of  FK228  under  normal  (aerobic)  fennentation  conditions  (Fig. 

11  3;  Table  S2;  Fig.  S2).  In  addition  to  the  Cosmid  18  and  the  depR  gene  on  a  vector,  all  these  three 

12  strains  received  the  sfp  gene;  while  other  E.  coli  strains,  regardless  of  having  the  acpS  gene  or 

13  not,  did  not  produce  FK228.  It  was  thus  clear  that  the  AcpS-type  PPTase  from  either  E.  coli  host 

14  or  from  C.  violaceum  cannot  activate  the  carrier  proteins  from  the  FK228  biosynthetic  pathway. 

1 5  Therefore  it  was  concluded  that  the  sfp  gene,  which  was  predicted  to  encode  a  broad  substrate- 

16  range  Sfp-type  PPTase,  is  involved  in  FK228  biosynthesis.  This  PPTase  should  have  other 

17  essential  function  as  well,  otherwise  the  sfp  gene  could  have  been  mutated  in  the  first  place.  In 

18  addition,  two  of  the  three  strains,  SW07  and  SW08,  which  also  received  either  fabDl  or  fabD2 

19  gene  of  C.  vioalceum,  produced  much  higher  levels  of  FK228  than  did  by  SW03.  Those 

20  observations  suggested  that  either  fabDl  or  fabD2  of  C.  violaceum  is  involved  in  FK228  and 

21  their  involvement  is  interchangeable;  it  is  also  possible  that  both  genes  are  redundantly  involved 

22  in  FK228  biosynthesis.  Surprisingly,  the  indigenous  fabD  of  E.  coli  appeared  to  function  as  well 

23  for  FK228  biosynthesis  in  the  SW03  strain  which  received  the  sfp  gene  but  not  fabDl  or  fabD2. 
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1  Sequence  alignment  of  the  C.  violaceum  FabDl  and  FabD2,  and  E  .  coli  FabD  shows  that 

2  FabDl  and  FabD  share  a  much  higher  amino  acid  sequence  identity  than  other  two  sets  of 

3  comparison  (Fig.  S3),  suggesting  that  FabDl  is  very  likely  the  housekeeping  FAS  II  component 

4  for  bacterial  fatty  acid  biosynthesis.  The  role  of  FabD2  is  less  certain.  The  fabD2  gene  was 

5  positively  identified  to  be  involved  in  FK228  biosynthesis;  but  like  the  Sfp-type  PPTase,  FabD2 

6  should  have  other  essential  function  as  well,  otherwise  the  fabD2  gene  could  have  been  mutated 

7  in  early  experimental  attempts. 

8  Now  it  is  proven  that  an  MCAT  component  of  FAS  II  is  recruited  by  PKS  modules  on 

9  DepBC  for  FK228  biosynthesis,  which  establishes  a  functional  cross-talk  between  the  bacterial 

10  primary  metabolism  and  secondary  metabolism,  and  adds  new  evidence  to  the  phenomenon  of 

1 1  complex  enzyme  evolution  from  FAS  II  to  type  II  PKSs  (33).  Besides  the  cases  where  an  ACP 

12  component  of  FAS  II  is  recruited  by  a  type  II  PKS  system  for  the  biosynthesis  of  actinorhodin 

13  (28)  or  tetracenomycin  (4,  17,  35),  other  types  of  cross-talk  have  also  been  reported.  For 

14  example,  in  the  biosynthesis  of  quinoxaline  antibiotics  in  two  Streptomyces  strains,  an  ACP  from 

15  the  primary  FAS  II  is  recruited  by  a  standalone  condensation  domain  of  type  II  NRPS  to  form  an 

16  initiation  module  of  the  biosynthetic  pathway  (3 1).  Furthermore,  cross-talk  between  a  type  I  FAS 

17  (FAS  I)  for  primary  metabolism  and  a  secondary  FAS  I  for  HC-toxin  biosynthesis  had  also  be 

18  postulated  in  the  fungal  species  Cochliobo/us  carbonum  where  only  one  gene  encoding  the  />- 

19  subunit  of  FAS  I  was  found  in  the  secondary  metabolic  gene  cluster,  while  the  a-subunit  of  FAS 

20  I  necessary  for  HC-toxin  biosynthesis  might  be  recruited  from  the  primary  FAS  I  (2).  Recently 

21  an  opposite  example  was  reported  in  the  apicidin  biosynthetic  pathway  in  Fusarium  semitectum 

22  where  a  gene  encoding  the  a-subunit  of  FAS  I  is  present  in  the  apicidin  biosynthetic  gene  cluster 

23  while  the  /3-subunit  of  FAS  I  necessary  for  apicidin  biosynthesis  is  speculated  to  be  recruited 
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1  from  the  primary  FAS  I  (20). 

2  Lastly,  the  ability  by  which  the  engineered  SW07  strain  produced  0.4  mg/L  of  actual  yield  or 

3  c.a.  741  nM  concentration  of  FK228  under  strict  anaerobic  conditions  is  worth  highlighting  (Fig. 

4  3;  Table  S2;  Fig.  S2).  Aimed  at  engineering  of  tumor- targeting  bio-agents  (6),  this 

5  accomplishment  represents  an  important  milestone  toward  the  goal.  The  next  step  will  be 

6  undertaken  to  introduce  the  FK228  biosynthetic  capacity  into  one  or  more  selected 

7  nonpathogenic  anaerobic  bacterial  species,  such  as  Bifidobacterium  longum  or  Clostridium 

8  oncolyticum  (25,  38),  so  that  when  the  engineered  bacteria  infiltrate  and  multiply  inside  the 

9  necrotic  region  of  solid  tumors,  FK228  will  be  produced  in  situ.  Because  FK228  is  a  potent 

10  HD  AC  inhibitor  and  a  new  anticancer  drug  effective  in  the  lower  nM  range  (22,  39),  the 

1 1  synergistic  efforts  of  bacterial  oncolysis  of  tumor  tissue  and  the  anticancer  activity  of  FK228 

12  could  potentially  be  clinically  effective  against  many  types  of  cancer. 

13 
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16 

17  FIGURE  LEGENDS 

18 

19  FIG  1.  Identification  of  missing  genes  necessary  for  FK228  biosynthesis  in  Chromobacterium 

20  violaceum  no.  968.  (a)  Local  genetic  map  of  two  candidate  PPTase-encoding  genes,  with  the  sfp 

2 1  gene  product  postulated  to  participate  in  phosphopantetheinylation  of  carrier  proteins  of  the 

22  FK228  biosynthetic  pathway,  (b)  Local  genetic  map  of  three  candidate  AT-encoding  genes,  with 

23  the  fabDl  and fabD2  gene  products  postulated  to  provide  AT  activities  to  complement  the  “AT- 

24  less”  PKS  modules  on  DepBC  proteins  for  FK228  biosynthesis,  (c)  Scheme  of  two  “AT-less” 
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1  PKS  modules  on  DepBC  proteins  that  require  a  PPTase  for  carrier  protein 

2  phosphopantetheinylation  and  a  trans  AT  for  polyketide  chain  elongation  in  FK228  biosynthesis. 

3  An  inactive  DH  domain  is  drawn  in  light  gray  and  labeled  as  DH1.  KS,  AT,  ACP,  DH,  KR  and 

4  PPTase  are  standard  abbreviations  of  domain/enzyme  names  that  have  been  described  in  the  text. 

5 

6  FIG  2.  Reconstitution  of  FK228  biosynthesis  in  E.  coli  cells,  (a)  Scheme  of  a  three-plasmid 

7  approach  for  reconstitution  of  FK228  biosynthetic  pathway  in  E.  coli  BL21(DE3)  cells.  Only  one 

8  cell  containing  one  copy  of  each  plasmid  is  drawn  into  picture  for  simplicity.  A  series  of 

9  engineered  strains  were  generated  with  different  combinations  of  plasmids  or  different  genes  on 

10  plasmids  (Table  1).  (b)  Examination  of  gene  expression  by  semi-quantitative  RT-PCR  in 

1 1  engineered  strain  SW07  cultivated  under  aerobic  conditions.  16S  rDNA  was  amplified  as  an 

12  internal  control. 

13 

14  FIG  3.  Levels  of  FK228  produced  by  recombinant  E.  coli  strains  relative  to  the  wild  type  strain 

15  of  Chromobacterium  violaceum  no.  968  cultivated  under  aerobic  conditions  unless  indicated  by 

16  *  for  under  anaerobic  conditions.  Data  are  mean  values  from  duplicate  experiments  with  error 

17  bars  indicating  standard  deviation.  Detailed  strain  information  and  FK228  levels  are  provided  in 

18  Table  1  and  Table  S2. 
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TABLE  1.  Bacterial  strains  and  plasmids  used  in  this  study 


Strains  or  plasmids 

Description3 

Source  or 

reference 

Chromobacterium  violaceum 

no.  968  (=FERM  BP- 1968) 

Wild  type  strain,  FK228-producing,  Apr  Thio' 

IPODb 

Escherichia  coli 

DH5a 

General  cloning  host 

Lab  stock 

S17-1 

Host  strain  for  interspecies  conjugation 

Lab  stock 

BL21(DE3) 

Host  strain  for  heterologous  gene  expression 

Novagen 

swot 

BL21(DE3)  harboring  Cosmid  18 

This  work 

SW02 

BL21(DE3)  harboring  Cosmid  18  and  pBMTL-3 -depR 

This  work 

SW03 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3  -depR  and  pCDFDuet- 

1  -sjp 

This  work 

SW04 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3  -depR  and  pCDFDuet- 

1  -acpS 

This  work 

SW05 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3  -depR  and  pCDFDuet- 

1-fabDl 

This  work 

SW06 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3 -depR  and  pCDFDuet- 

1  -fabD2 

This  work 

SW07 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3 -depR  and  pCDFDuet- 

1  -sfp-fabDl 

This  work 

SW08 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3 -depR  and  pCDFDuet- 

\-sfp-fabD2 

This  work 

SW09 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3 -depR  and  pCDFDuet- 

1  -acpS-fabDl 

This  work 

SW10 

BL21(DE3)  harboring  Cosmid  18,  pBMTL-3 -depR  and  pCDFDuet- 

1  -acpS-fabD2 

This  work 

Plasmids 

pGEM-3Zf 

Apr,  general  cloning  vector 

Promega 

pGEM-T  Easy 

Apr,  general  cloning  vector 

Promega 

Cosmid  18 

Apr,  Kanr,  cosmid  clone  containing  the  FK228  biosynthetic  gene 

cluster  (dep)  and  flanking  DNAs,  shotgun  sequenced 

(9) 

pBMTL-3 

Cmr,  pBBRl  ori,  broad  host-range  vector 

(23) 

pBMTL-3 -afe£>R 

Cmr,  depR  (with  RBS  from  pET29a)  cloned  into  pBMTL-3 

(27) 

pCDFDuet-1 

SnT,  CDF  ori ,  dual  expression  vector 

Novagen 

pCDFDuet-1-^ 

sjp  cloned  into  the  MCS1  region  of  pCDFDuet-1 

This  study 
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pCDFDuet-1 -acpS  acpS  cloned  into  MCS1  region  of  pCDFDuet-1  This  study 

pCDFDuet- 1-yaM)/  fabDl  cloned  into  MCS2  region  of  pCDFDuet-1  This  study 

pCDFDuet- 1  -fabD2  fabD2  cloned  into  MCS2  region  of  pCDFDuet-1  This  study 

pCDFDuet-1  -sfp-fabDl  sfp  cloned  into  MCS1 ,  fabDl  cloned  into  MCS2  of  pCDFDuet-1  This  study 

pCDFDuet- \-sfp-fabD2  sfp  cloned  into  VIC'S  1  ,fabD2  cloned  into  MCS2  of  pCDFDuet-1  This  study 

pCDFDuet-1  -acpS-fabDl  acpS  cloned  into  MCSl,  fabDl  cloned  into  MCS2  of  pCDFDuet-1  This  study 

pCDFDuet- 1  -acpS-fabD2  acpS  cloned  into  MCS1  ,fabD2  cloned  into  MCS2  of  pCDFDuet-1  This  study 


1  3  Apr,  ampicillin  resistance;  Cmr,  chloramphenicol  resistance;  Kanr,  kanamycin  resistance;  Smr,  streptomycin 

2  resistance;  Thio1,  thiostrepton  resistance.  MCS,  multiple  cloning  site. 

3  b  IPOD,  International  Patent  Organism  Depositary,  Tsukuba,  Japan. 

4 

5 

6 

7 

8  TABLE  2.  Candidate  genes  identified  through  rapid  genome  sequencing  and  their  involvement 

9  in  FK228  biosynthesis 


Gene  Name3 

Deduced  Product  and  Predicted  Function 

Gene 

Dispensability 

Involvement 

in  FK228 

Biosynthesis 

AT-encoding  genes 

fabDl 

FabDl,  likely  involved  in  fatty  acid 

biosynthesis 

No 

Positive 

fabD2 

FabD2,  function  unpredictable  a  priori 

No 

Positive 

AT 3 

AT3,  likely  involved  in  cell-surface  0- 

antigen  biosynthesis 

Yes 

No 

PPTase-encoding  genes 

acpS 

AcpS-type  PPTase,  likely  involved  in 

primary  metabolite  biosynthesis 

No 

No 

sfp 

Sfp-type  PPTase,  likely  involved  in 

secondary  metabolite  biosynthesis 

No 

Yes 

10  3  AT,  acyltransferase;  PPTase,  phosphopantetheinyltransferase. 
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FIG  1.  Identification  of  missing  genes  necessary  for  FK228  biosynthesis  in 
Chromobacterium  violaceum  no.  968.  (a)  Local  genetic  map  of  two  candidate 
PPTase-encoding  genes,  with  the  sfp  gene  product  postulated  to  participate  in 
phosphopantetheinylation  of  carrier  proteins  of  the  FK228  biosynthetic  pathway, 
(b)  Local  genetic  map  of  three  candidate  AT-encoding  genes,  with  the  fabDl  and 
fabD2  gene  products  postulated  to  provide  AT  activities  to  complement  the  “AT- 
less”  PKS  modules  on  DepBC  proteins  for  FK228  biosynthesis,  (c)  Scheme  of  two 
“AT-less”  PKS  modules  on  DepBC  proteins  that  require  a  PPTase  for  carrier 
protein  phosphopantetheinylation  and  a  trans  AT  for  polyketide  chain  elongation 
in  FK228  biosynthesis.  An  inactive  DH  domain  is  drawn  in  light  gray  and  labeled 
as  DH1.  KS,  AT,  ACP,  DH,  KR  and  PPTase  are  standard  abbreviations  of 
domain/enzyme  names  that  have  been  described  in  the  text. 
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120  min  post  induction 


FIG  2.  Reconstitution  of  FK228  biosynthesis  in  E.  coli  cells,  (a)  Scheme  of  a  three- 
plasmid  approach  for  reconstitution  of  FK228  biosynthetic  pathway  in  E.  coli 
BL21(DE3)  cells.  Only  one  cell  containing  one  copy  of  each  plasmid  is  drawn  into 
picture  for  simplicity.  A  series  of  engineered  strains  were  generated  with  different 
combinations  of  plasmids  or  different  genes  on  plasmids  (Table  1).  (b)  Examination  of 
gene  expression  by  semi-quantitative  RT-PCR  in  engineered  strain  SW07  cultivated 
under  aerobic  conditions.  16S  rDNA  was  amplified  as  an  internal  control. 
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FIG  3.  Levels  of  FK228  produced  by  recombinant  E.  coli  strains  relative  to  the  wild 
type  strain  of  Chromobacterium  violaceum  no.  968  cultivated  under  aerobic  conditions 
unless  indicated  by  *  for  under  anaerobic  conditions.  Data  are  mean  values  from 
duplicate  experiments  with  error  bars  indicating  standard  deviation.  Detailed  strain 
information  and  FK228  levels  are  provided  in  Table  1  and  Table  S2. 


